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We study the properties and production mechanisms of color-octet scalars at the LHC. We focus on 
O : the stogle product of both charged and „e„„,l members of ,„ (8, 2, I/2 double, rhrough bodom 

C\j . quark initial states. These channels provide a window to the underlying Yukawa structure of the 

^ ■ scalar sector. Color-octet scalars naturally appear in grand unified theories based on the SU (5) gauge 

f"*^ ' symmetry. In the context of adjoint SU (5) these fields are expected to be light to satisfy constraints 

C*") ' coming from unification and proton decay, and may have Te V-scale masses. One combination of their 

couplings is defined by the relation between the down-quark and charged-lepton Yukawa couplings. 
Observation of these states at the LHC gives an upper bound on the proton lifetime if they truly 
I 1 arise from this grand unified theory. We demonstrate that TeV-mass scalars can be observed over 

D ( background at the LHC using boosted top quark final states, and study how well the scalar Yukawa 



parameters can be measured. 
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O ' Many motivations exist for physics beyond the Standard Model (SM) of particle interactions. Much of 

00 ' 

O ■ the research in high-energy physics for the past decades has been driven by the principle of naturalness. This 



I. INTRODUCTION 



idea states that widely disparate scales in Nature, such as the electroweak and Planck scales, should not exist 
without some symmetry principle or dynamical mechanism to explain their ratio. Additional motivations to 
believe in physics beyond the SM have recently been found. These include the experimental discovery of 
neutrino masses and the strong indication of Te V-scale dark matter. 

Recent study of physics beyond the SM has focused on the TeV scale, both because of dark matter and 
because of the imminent start of the experimental program at the Large Hadron Collider (LHC). Significant 
effort has focused on brainstorming all possible scenarios that might be observable at the LHC. Two possible 
questions to ask are the following. Are there forms of new physics whose observability at the LHC has not 
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been studied? Do current experimental data motivate certain types of TeV-scale physics? Several interesting 
answers to the second question have recently been given. An example relevant to the discussion here is the 
experimental absence of flavor-changing neutral currents, which suggests that new physics should satisfy 
the principle of minimal flavor violation (MFV). This states that the flavor structure of physics beyond 



the SM should be completely determined by the SM Yukawa structure HI]. Another question that arises 
is whether measurements at the LHC can be used to understand details of high-scale theories. Both the 
seesaw mechanism for neutrino masses and the near unification of SM gauge couplings suggest new physics 
near 10 15 GeV. Effort has been devoted to determine whether LHC results can indirectly probe these high 
energies. 

An interesting scenario of physics beyond the SM suggested by the above questions is the possibility of 
a scalar sector containing more than the single Higgs doublet of the SM. One motivation for considering 
extended scalar sectors is purely phenomenological. Additional scalars can cause large deviations in the 
branching fractions of the standard Higgs boson, modifying experimental search strategies [2J]. Their LHC 
signatures have not been extensively studied. Another motivation is that certain seldom studied scalar 
representations naturally satisfy the principle of minimal flavor violation (MFV). A recent analysis showed 
that scalars transforming as either (l,2) 1 / 2 or (8,2) 1 / 2 under the gauge group SU(3)c ® SU(2)l ® 
U(l)y can naturally have fermionic couplings proportional to the SM Yukawa structure [3]. While the first 
representation is just that of the Higgs doublet, the second denotes a color-octet scalar boson not found in 
the SM. 

Color-octet scalars also potentially provide a window to high-scale physics. The unification of the gauge 
interactions is one of the main motivations for physics beyond the SM, and it occurs naturally within Grand 
Unified Theories (GUTs). Recently, in the context of adjoint SU(5) |4], the possibility of light color 
octets in agreement with the constraints coming from the unification of gauge couplings and proton decay 
was pointed out Q5[] . The couplings of these states may therefore probe physics at the GUT scale. While 
observation of color-octet states at the LHC would not prove the existence of a GUT in Nature, it would 
offer the interesting possibility of correlating measurements from the LHC with future searches for proton 
decay. 

nnn 

Several studies regarding the discovery of color-octet scalars at the LHC have been performed |3lla,LZ 
9)]. The most promising channel appears to be pair production of either two neutral scalars or a charged pair 
through the partonic channel gg — > SS, where S denotes either a neutral or charged state [6Q. Discovery of 
color-octet scalars of at least 1 TeV and likely much higher is possible at the LHC. As the QCD couplings of 
the scalars to the gluon are fixed by their octet representation, this mode has the advantage of being model- 
independent. While this is a nice feature for discovery of the state, it does not allow the structure of the 
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Yukawa couplings to be measured. The situation is similar to the study of the top quark at the Tevatron. The 
discovery mode pp — ► it doesn't probe the underlying Yukawa structure of the top quark. The measurement 
of Vtb requires the study of single top production. 

In analogy with single top production, we investigate the use of single production of charged scalars 
through the partonic channel bg — > S^t to probe the Yukawa structure of the (8, 2) 1 / 2 scalar and the 
connection to the relation between Yj, and Y T . It is likely that the dominant decay of the scalar is through 
S + — > tb, leading to the production process pp — > S + i + S~t — ► tib + tib. We expect the dominant 
background to this signal to be ttj, as we discuss later in more detail. We perform an analysis indicating that 
the signal can be seen over background at the LHC, and identify the relevant kinematic features that make 
this possible. We also study neutral scalar resonance production through the partonic channel bb — > S — > tt, 
which can become sizable for a large scalar Yukawa coupling to the bottom quark. This channel provides 
another window into the Yukawa sector of the color-octet scalar. We illustrate that this can also be observed 
over background at the LHC. We discuss how well the scalar Yukawa couplings can be measured in these 
channels. 

Our paper is organized as follows. We present our notation and setup, and explain how color-octet 
scalars provide a window to GUT-scale physics, in Section|IIl We discuss the various production and decay 
mechanisms for color-octet scalars in Section |III] In Section [TV] we describe the details of our simulation 
procedure. We apply this analysis procedure to the study of single charged scalar production and neutral 
scalar production through the bb initial state in Section [V] and demonstrate how to separate signal from 
background in these two channels. We conclude in Section[VTJ Analytic expressions for the production and 
decay modes used in the analysis are given in the Appendix. 



II. MFV, GRAND UNIFICATION, AND LIGHT COLOR OCTETS 

We first present some basic results relevant for studying the scalar octets at the LHC. We only discuss 
issues which directly affect our analysis; details on other aspects of color octets can be found in Ref. yfl. 



A. Minimal flavor violation and light octets 

We consider an extension of the Standard Model where the scalar sector is composed of the SM Higgs, 
H ~ (l,2)]/ 2 , and a color octet, S ~ (8, 2) 1 / 2 . In this case the extra Yukawa interactions due to the 
presence of the octet are given by 

C Y = d R T D St Q L + u R Tu Qt SP e aP + h.c, (II. 1) 
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where the matrices Ul, Ur, Di and Dr diagonalize the mass matrices for quarks. denotes the charged 
octet scalar and S RI are respectively the CP-even and CP-odd neutral scalars. If we assume minimal 
flavor violation [3], then 



IV = rfrYu and F D = i] D Y D . 



(II.4) 



In this case the physical interactions are 
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where Vckm is the Cabibbo-Kobayashi-Maskawa matrix, v is the SM Higgs vev, u and d are respectively 
the SM up- and down-type quarks, and mu, are their masses. r]u and rjjj are parameters that describe 
the strength of the scalar couplings to matter. 

The mass splittings between the charged state and the neutral members of the (8, 2)i/ 2 depend on the 
details of the potential describing their self-interactions and their coupling to the regular Higgs doublet. If 
the splitting is sufficiently large, decays such as — > S°W il are allowed, where S° denotes one of the 
neutral scalars. Otherwise, the charged scalar will decay predominantly via S + — > tb due to the couplings 
in Eq. III. 5 1 An analysis of the scalar-potential parameter space allowed by current experimental constraints 
reveals that scalar cascade decays are unlikely We will assume in our study that the charged scalar 



decays only to tb pairs. 
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B. Adjoint SU(5) and light color octets 

The existence of the color octets mentioned above can also be understood in the context of the simplest 
grand unified theories based on SU(5) lioll . For a review on grand unified theories and their phenomenolog- 
ical aspects, see Ref. jllll . In Ref. jl2ll it was realized that in order to have a consistent relation between the 
charged lepton and down quark masses in the context of renormalizable SU(5) theories, a new Higgs in the 
45 representation must be introduced. In these models the Higgs sector is composed of the representations 
5h, 24// and 45//, and the relevant Yukawa interactions are given by 



S- 



Yukawa 



d A x \ Yi 10 5 5^ + Y 2 W 5 45^ + Y 3 10 10 5 H + Y 4 10 10 45// + h.c.(II.6) 



where 5 = (d , and 10 = (u , Q, e c )i- The masses of the SM charged fermions are 



T V A5 
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and M V = 4(Y 3 + Y 3 T )^= - 8 (Y 4 - if ) ^ = M? + Mg, 



(n.7) 

(DL8) 
(II.9) 



with v 5 /V2 = (5 H ), and = (45 H )\ 5 = (^ H )f 



(45 H )f. 



The color octet studied in Ref. |3|] lives in the 45 representation of the SU(5) theory, S C 45//. We 
note that S = $1 in the notation of Ref. ||4ll5|l- Using the above Yukawa interactions we find the following 
interactions between the SM fermion and the octet: 



UR 



V2M 
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where 



(11.10) 
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(45//)f --5) (45//); 

the indices i,j = 1,2,3 and a = 4, 5. The couplings in Eq. (III. II ) are then denned by 

„ _M E -M T D 



and 



^45 2^45 

As pointed out in Ref. jl3], this model has only one light Higgs doublet, and v\ + «| 5 



(11.11) 



(11.12) 
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Recently, a grand unified theory has been proposed where the neutrino masses are generated through the 
Type I and Type III seesaw mechanisms J4|]. This predicts the existence of light color-octet scalars. This 
issue has been studied in detail in Ref. 150, where it was concluded that in order to satisfy the constraints 
coming from proton decay and unification of gauge interactions, the color octets must be very light. The 



A. 



This result indicates it may be 



upper bound on the mass of this color octet is mj < 4.4 x 10 5 GeV 
possible to produce these exotic fields at the LHC. 

In order to investigate the couplings of the octets at the LHC and understand their connection to the GUT 
theory, we assume that at low energy the MFV hypothesis is valid. Neglecting mixings, we find 

v 

VD = -r^ {m T -m b ). (11.13) 

4u 45 m 6 



and U45 < v, the coupling rjd can 

be significantly larger than unity. Measurement of t]e> fixes V45, the vev of 45jj. Determination of rju then 
probes the Yukawa coupling Y4 that parametrizes the interaction between 45 h and up-type quarks, as can 
be seen using Eqs. UI.9l and UI.12l 

In order to ascertain that the color-octet scalar is coming from a GUT, information from other exper- 
iments is required. Observation of proton decay would constitute strong evidence of grand unification in 
Nature. Remarkably, an upper bound on the proton lifetime is predicted by measurement of the color-octet 
mass at the LHC. This is illustrated in Fig. [TJ which shows the relationship between the color-octet scalar 
mass mo, the GUT scale Mqtjt> and various other mass scales of the SU(5) theory (for more details see 
Refs. |4j,|5|]). The horizontal dashed line on this plot indicates the current constraint from non-observation of 
proton decay, while the diagonal dashed line denotes the requirement on GUT parameters needed to obtain 



successful leptogenesis H 1 3f1 . Regions in the upper-left portion of the figure are allowed by these constraints. 
Contours of constant color-octet scalar mass are horizontal lines in this figure. The important point to note 
is that the color-octet scalar mass fixes the maximum possible GUT scale, M™^. This determines an up- 
per bound on the proton lifetime, as the dominant contributions to this process are the super-heavy X and 
Y bosons of the SU(5) theory, whose couplings are fixed by unification. For example, if 771,5 = 1 TeV, 
« 6 x 10 15 GeV, leading to the following bounds on the partial lifetimes: 
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irs. (11.14) 
Measurement of ms at the LHC then gives an unavoidable prediction for the rate of proton decay. 



III. OCTET DECAYS AND PRODUCTION MECHANISMS AT THE LHC 



Several partonic production mechanisms can lead to color-octet scalar production at the LHC. Pair pro- 
duction processes such as gg — > S + S~ , SrSr, Si Si, which depend only on the QCD coupling, were 
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FIG. 1: Relationship between the color-octet scalar mass ms, the maximum GUT scale Mqijt, and various other 
mass scales of the SU(5) theory. The horizontal dashed line on this plot indicates the current constraint from non- 
observation of proton decay, while the diagonal dashed line denotes the requirement on GUT parameters needed 
to obtain successful leptogenesis. Regions in the upper-left portion of the figure are allowed by these constraints. 



Contours of constant color-octet scalar mass are horizontal lines in this figure. For further details see Refs. 



,4. 



studied in Ref. [6]. The gluon-initiated single-production processes gg — ► Srj, studied in Ref. 11511 . can 
be important for large rju- If r/o is larger than unity, then bb — ► Srj becomes significant. Charged scalars 
can be singly produced via the partonic interaction gb — > tS~ and its charge conjugate. Pair production of 
a charged and a neutral scalar is possible via an intermediate W boson, for example through the partonic 
process ud — > W + — > S + Srj, and is fixed by the SU(2)i quantum number of the scalar doublet. We 
focus here on the single-production processes bb — ► Srj and gb — ► tS~ which allow measurements of rju 
and rjD. 

As mentioned previously, we focus on top-quark decays of the scalars, S + — ► tb and Sr i — > tt, as a 
study has indicated that the mass splitting between the scalar states is likely not large enough for cascade 
decays such as — ► SrjW^ to be important |6fl. If both rju and r/o are small, then the scalars might be 
stable on collider length scales, leading to missing energy and highly ionizing tracks. Scalar bound states 
can also occur in this case near pair-production thresholds. We do not further pursue these possibilities here. 

The cross sections for the processes pp — ► S + S~ and pp — > tS~ + iS + are simple to derive given the 
interactions in Eq. dll.51 ). We present them below in Fig.|2]as a function of the scalar mass mj for r\ 



rju = 1. These are leading-order results obtained using the CTEQ6L1 partem distribution functions jlfjll 
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Two things should be noted from these results. First, the cross section for ms ~ 1 TeV is sufficiently 
large to allow study of the single scalar production mode. Second, the S + S~ production process decreases 
more quickly with rising scalar mass. This is simply due to the phase space behavior of each process. The 
di-scalar production process falls off as s — 4m|, while the single production decreases as s — (ms + mt) 2 , 
where s is the partonic center-of-mass energy. The single charged scalar production mode may therefore 
extend the search reach at high ms, although we do not pursue this possibility here. 

We also study the resonant production of Srj through the partonic process bb — ► Srj — > tt, which can 
become important for large values of t)d. For illustration we plot this cross section also in Fig. [2] assuming 
tjd = 40 and rju = 1. Analytic expressions for the bg — ► St, bb — > Srj, and gg — > S + S~ partonic cross 
sections, as well as for the scalar decays, are given in the Appendix. 
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FIG. 2: Inclusive cross sections for the processes pp — ► S + S~, pp — > tS~ + tS + , and pp — > Srj through bottom 
quark fusion at the LHC. We have set the renormalization and factorization scales to fi = mt + ms for single scalar 
production, to \i = 2ms f° r di-scalar production, and to \i = 7715 for single scalar production, and have used the 
CTEQ6L1 parton distribution functions. 



IV. DETAILS OF SIMULATION 

To discuss production and measurement of color-octet scalars at the LHC, we must determine how well 
the signal can be observed over background. We describe here our procedure for estimating this. 



We implement the charged and neutral scalars in MadEvent [17] using Feynman rules derived from 
Eq. (III.5I ). We set the renormalization and factorization scales on an event-by-event basis using fi R = \<? F = 
m\ + Y1iPt> where the sum runs over all objects in the final state including missing energy. After the scalar 
decay, the final state is tib + ttb. We decay the top quarks fully using the DECAY routine of MadEvent, 
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which does not preserve spin correlations. MadEvent allows spin correlations to be kept throughout the 
entire decay chain; we have checked on a smaller sample than that used in our full analysis, that they have a 
negligible effect given the cuts we use. We set mg = 1 TeV to illustrate the analysis procedure. We discuss 
later our simulation of the backgrounds. We compute all cross sections at leading order in QCD; while 
the next-to-leading order results for the backgrounds we will later consider are known, those for our signal 
processes are not. This is a conservative choice, as this procedure will tend to underestimate the obtainable 
statistical significance S/ \fB if the QCD corrections increase both cross sections roughly equally. 

The above procedure leads to sets of parton-level events for both signal and background. We now 
describe our procedure of applying a simple detector simulation to these events and finding a set of cuts to 
extract the signal. 



A. Detector simulation 

We perform a simple estimate of detector effects by applying the following four actions to our parton- 
level events. 

• We model the acceptances of LHC detectors by discarding objects with \rj\ > 2.5. 

• To simulate the effect of imperfect energy resolution, we subject each particle in the final state to 
a Gaussian smearing of either its energy E or its transverse momentum px- We use the following 
smearing parameters. 

- jet-like objects (light quarks, b quarks, gluons): ^ = y © 0.03. 

- electrons: ^ = , ai © 0.007. 

- muons: = 0.15 p T /TeV © 0.005. 



These values are consistent with LHC expectations 11811 . After all final state objects have been 
smeared, the transverse momentum is recalculated. The p x and p y components of the missing energy 
are set to those values which ensure momentum conservation. 

We combine partons using a cone algorithm with AR = 0.4. These combined objects are then 
considered jets. 

We mimic LHC capabilities for tagging heavy flavor jets by assuming 6-jets are tagged with 60% 
efficiency, and that other jets fake 6-jets with a 3% fake rate. 
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An alternative to the above smearing of parton-level events is to run these events through the PYTHIA 
parton-shower ifl^j and PGS detector simulation |2(]]. We have tested that using this other technique of 
showering the parton-level events with PYTHIA and then using PGS to reconstruct observable objects 
leads to qualitatively similar results as found with the procedure described above; a slight degradation of 
our figure of merit S/yB occurs with the alternate procedure. We present results using the smearing 
technique described above due to its simplicity and transparency. 



B. Top reconstruction 



Since the final state contains two top quarks, their reconstruction becomes an important part of the 
analysis. With ms = 1 TeV, both tops in the signal sample will be highly boosted. These should then 
decay into a collimated b quark and W boson. This will be instrumental in constructing a top-finding 
algorithm. We study the reconstruction only of events where one top quark decays leptonically or both 
decay hadronically, and we do not attempt to reconstruct r leptons. If there is a single lepton present in the 
final state, the W can be reconstructed using the lepton and missing energy and then combined with a 6-jet 
to reconstruct the top quark. If there are no leptons in the final state, one can look for jets or combinations 
of jets with an invariant mass near that of the top. Several recent studies have used similar methods for 



reconstructing highly boosted top quarks 12 ll 
semi-leptonic and hadronic top algorithms. 



22j, 



23,24,23,26, 



2711 . We describe below the details of our 



1. Semi-leptonic top reconstruction 



The semi-leptonic algorithm is applied when there is only one lepton in the final state, either an electron 
or muon. Motivated by the significant boost of the top quark, we search for the jet with the smallest AR = 
y A<p 2 + Ar/ 2 separation from the lepton. We do not require the jet to be 6-tagged. If the separation between 
the lepton and jet is greater than AR max = 0.6, the event is passed on to the hadronic top reconstruction 
algorithm. Otherwise, the lepton and missing energy are combined such that the longitudinal component 
of the missing energy is constrained by the W mass. There is a two-fold ambiguity in determining the 
longitudinal momentum; the degeneracy is broken by choosing the solution with the smallest AR separation 
between the reconstructed W and the previously chosen jet. If the invariant mass of the combined jet, lepton, 
and missing energy falls within a window 171 ± 50 GeV, the combined object is labeled a top quark. 
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2. Hadronic top reconstruction 



When no leptons are present after the semi-leptonic algorithm is performed, a purely hadronic algorithm 
attempts to reconstruct other top quarks in the event. If the invariant mass of any single jet falls within 
the top mass window 171 ± 50 GeV, indicating the possibility that all of the top decay products have been 
combined by the jet algorithm, this jet is tagged as a top quark. Next, the two jets with the smallest AR 
separation are found, again imposing a maximum value AR max = 0.8. If the invariant mass of these two 
jets falls within the top mass window, they are combined and tagged as a top quark. If the invariant mass of 
the two jets falls within the range 40-121 GeV, indicating that they may be the decay products of a W boson, 
they are combined provisionally, and the jet with the smallest ARwj < Ai? max is found. If the invariant 
mass of the W candidate and the third jet falls within the top mass window, the three jets are combined and 
tagged as a top quark; otherwise, the combination of the first two jets to form the W candidate is undone. 
This algorithm continues until no further top quarks are found. 



V. ANALYSIS 



A. Single charged scalar production 



We now present results obtained by applying the above analysis procedure to the analysis of single 
charged scalar production via the partonic process gb — > St — ► ttb, where b, t can represent either quarks 
or antiquarks. We discuss the kinematic features that allow separation of signal from background, and esti- 
mate the obtainable statistical significance. We use S/ y/~B as our statistical measure to determine whether 
the signal is observable over the statistical fluctuation in the expected background. 



The signal is proportional to the coupling 1 77^7 1 4 /r,s' ~ \r]u\ 2 - We set rjo 



1 in this analysis. 



Previous studies of semi-leptonic tt events at the LHC by the ATLAS and CMS collaborations have indi- 
cated that the tt signal can be separated from the backgrounds of WW, WZ, ZZ, W + jets, and Z + jets, 



finding S/B = 65(27) with 10(l)fb- 1 118 



We therefore focus only on the ttj background when 
reconstructing semi-leptonically decaying tt pairs, assuming that the same holds true with the addition of 
an extra jet. Backgrounds to the fully hadronically decaying top pair are more dangerous, as QCD multi- 
jet production can lead to the same final state. Studies of fully hadronic tt reconstruction by ATLAS and 
CMS have found that S/B « 1/3 is achievable with lfb -1 [18] . Several recent studies have attempted 



to distinguish high invariant mass tt pairs from QCD backgrounds using jet mass and substructure differ 



ences 11261 



2711 . and have had some success. For example, the study of Ref. 12711 found that S/B m 1 is 



achievable by decomposing a hadronically decaying top quark into sub-jets and using the reconstruction of 
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two of these to My/', this is similar to our technique described in Section ITVB I Simulation of these pure 
QCD backgrounds is right now rather uncertain without LHC data to validate currently used Monte Carlo 
programs. In our analysis we simulate only the ttj background to the fully hadronic mode, and present two 
sets of results: one using only semi-leptonically reconstructed events, and another using both semi-leptonic 
and hadronic events. The actual LHC capability will likely be something intermediate between these two 
limits when QCD multi-jet backgrounds are included. As will become clear in the following discussion, the 
kinematic region of interest consists of three well-separated top and bottom jets, which validates our use of 
the ttj hard matrix elements without matching to a parton shower. We have checked this by passing our 
parton-level events through PYTHIA and PGS and finding similar results. 

Imposing only the cuts < 2.5 and p? > 20 GeV for b and light jets, the signal cross section 
pp — > S + i + S~t — > tib + tib is 90 fb, while the pp — > ti + jet background is 526 pb. Signal and 
background events were generated using MadEvent, and then decayed using the DECAY function. The 
analysis was performed on 10 5 signal events and 5 x 10 6 ttj events. Scales were set on an event-by-event 
basis such that n 2 R = fJ/p = ml + ^ p\, where the sum runs over all particles in the final state including 
missing energy. 

We first impose the following set of cuts motivated by the kinematics of the signal process. We demand 
that our top reconstruction algorithm find two top quarks, and in addition require another 6-tagged jet. We 
expect the top quark recoiling against the scalar to be very energetic, and we also expect the scalar decay 
products to be highly boosted. We also expect the signal to have a large s. This motivates the series of 
kinematic cuts presented in the upper half of Table U We have defined H? as the scalar sum of the Et 
of all visible final state objects and the px of the missing energy. t\ and &i are respectively the hardest 
top-tagged and 6-tagged jets, while £2 is the softer of the two top tags. We note that as this state can be 
discovered in the di-scalar channel, we can tune our cuts to the value of mg measured in this other mode, 
as we have done here for ms = ITeV in the invariant mass cut 900 GeV < Mjj 1>tl < 1100 GeV. The 
size of this interval is larger than the expected calorimeter resolution 1 18fl - These numbers are for 300 fb~ x 
of integrated luminosity. For illustration of the kinematic difference between signal and background, the 
Mfa t! distribution for both with arbitrary normalization is shown in Fig. [3] 

We can improve on the significance of the signal by using details of the event shape. The signal consists 
of a prompt top quark recoiling against the color octet scalar, which decays to another top quark and a 
6-jet. Since the scalar is heavy and is not produced with a large boost, its decay products will not typically 
be highly collimated with the prompt top quark. We expect the two top quarks and the 6-jet to form a 
"Mercedes Benz" topology, as shown in Fig. [4] However, we find that the ttj events that survive the above 
kinematic cuts have a very back-to-back t\, b\ pair. This motivates the further set of cuts in the lower half of 



13 



Cut Si Sh Bi Bh 

H T > 1000 GeV 4940 7820 8.00 x 10 5 1.26 x 10 6 

3h,t2,h 93 283 9610 2.85 x 10 4 

900 GeV < M blM < 1100 GeV 57 158 480 835 
AR bltl ,AR blt2 < 3.0 41 126 127 230 
cos 6 tlbl < 0.5, cos t2bl < 0.7 34 111 54 144 

TABLE I: Effects of cuts on cross sections; the upper half includes cuts motivated by the energy released in the 
interaction, while the lower half is motivated by the event topology. Si and Sh denote the expected numbers of semi- 
leptonic and hadronic signal events after cuts for 300 fb _1 of integrated luminosity; Bi and Bh denote the same for 
the background. 



m(tl,bl) m(tl,bl) 




500 1000 1500 2000 500 1000 1500 2000 

m (GeV) m (GeV) 



FIG. 3: Invariant mass M bl ^ tl for the charged octet signal (left panel) and the tij background (right panel). Included 
are the two top tags, one 6-tag, and Ht > 1000 GeV cut of TableQ] The normalization of the vertical axis is arbitrary. 
All scales on each plot are linear. 

Table H We have used 9^ to denote the lab frame angle between the top jets and the 6-jet. These cuts are 
designed to select events with the signal characteristics described above. For illustration of the kinematic 
differences between signal and background, the cos 9 t2bl distribution for both with arbitrary normalization 
is shown in Fig. [5] 

After imposing these cuts, we find the values of S/VB and S/B listed in Table ITT1 The successive 
improvement in both measures is evident as the cuts are imposed. We note that the large significances 
obtained with only the Ht cut are large because only the tij background has been considered. We believe 
that demanding the top and b tags is crucial in reducing other backgrounds in the fully hadronic mode. It 
may be possible to relax the tagging requirements in the semi-leptonic mode. Observation of this signal 
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t * b 



FIG. 4: "Mercedes Benz" topology of the charged scalar signal process, 
cos 0(bl,t2) cos 0(bl,t2) 




-1.0 -0.5 0.0 0.5 1.0 -1.0 -0.5 0.0 0.5 1.0 



FIG. 5: Lab frame polar angle cos 9t 2 b 1 for the charged octet signal (left panel) and the tij background (right panel). 
Included are the two top tags, one 6-tag, and Ht > 1000 GeV cut of Table J] The normalization of the vertical axis 
is arbitrary. All scales on each plot are linear. 

with 5o" significance requires slightly more than 300 fb" 1 if only semi-leptonic events are used, while less 
than 100fb _1 is sufficient for discovery if hadronic tops can be implemented in the analysis. We note 
that this is for the value r\\j = 1; the significance S/\fB scales as \rju\ 2 , while the required luminosity 
for 5a observation scales as l/|r/[/| 4 . Unless rju > 1, this is a search mode that requires a large amount of 
integrated luminosity. The purity of the sample after cuts, as measured by S/B, is reasonably high. We have 
not used all possible kinematic handles, so these results could likely be improved somewhat. For example, 
the invariant mass and pt distributions of the top quarks and bottom jet could be binned and compared 
between signal and background, allowing the shape differences to be more fully exploited. Our goal here is 
only to show that observation of the signal is feasible at the LHC. 

An observation of signal over background will permit measurement of the coupling \t]u\ 2 . rjjj can 
be expressed via GUT-scale parameters using Eqs. (111.41 ) and (III. 12b . The obtainable precision on this 
parameter depends on statistical uncertainties, and on how well the signal and background cross sections 
can be predicted. We assume that these errors can be approximately written as fractional uncertainties in 
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Cut Si/y/Bi Si/Bi Si+h/ V 'Bi + h Si + h/Bi + h 

H T > 1000 GeV 5.52 0.0062 8.89 0.0062 

3t 1 ,t 2 ,h 0.95 0.0096 1.92 0.0099 

900 GcV < M butl < 1100 GcV 2.60 0.12 5.93 0.16 
AR bltl ,AR blt2 < 3.0 3.60 0.32 8.82 0.47 

cos tlbl < 0.5, cos 9 t2bl < 0.7 4.56 0.62 10.3 0.73 

TABLE II: Obtained significances after successively imposing the cuts in Table U Si+h denotes the combination of 
semi-leptonic and fully hadronic signal events, and Bi+h denotes the same for the background. We assume 300 fb" 1 
of integrated luminosity; other values of L change only S/ \f~B~, and can be obtained from the numbers in the table by 



the scaling ^/L/300 fb" 1 . 

the signal and background cross sections, in which case the coupling uncertainty is 

5\rju\ 2 _ y/STB ®F S S @F B B 



(V.15) 



\vu\ 2 S 

Fs and Fb denote the fractional uncertainties for the St and ttj cross sections, respectively. We add the 
various components of the error in quadrature. We consider uncertainties arising from higher-order QCD 
effects, and begin our estimate of the errors by varying the scale choice in our calculation by a factor of two 



around our canonical value (J> = y m 2 + YlPr f° r b° tn S1 g na l an d background. These variations give some 
measure of the uncertainties in the leading order cross sections, which we denote as Fgg. For the inclusive 
cross section, we find F^° 0.15 and Fg « 0.40. The size of the variations are similar after the cuts are 
imposed, although the statistics and severity of the cuts for the background make it difficult to get a very 
precise estimate. Inserting these numbers into Eq. (IV. 151) . and assuming 3001b" 1 , leads to 5S/S = 0.72 
for only semi-leptonic decays and 5S/S = 0.58 when hadronic decays are added. Not only is a precise 
determination of the color-octet Yukawa impossible, it would be difficult to ascertain that a signal was even 
present. 

The above discussion clearly indicates that knowledge of the higher-order QCD corrections to both 
signal and background is required for interpretation of the measurement. The next-to-leading order QCD 
corrections to the signal are unknown, but are straightforward to calculate. We study below the precision in 
the signal rate that can be obtained assuming Fs = 0.05 and Fs = 0.10, which we consider reasonable es- 
timates of the residual error after next-to-leading order QCD corrections are included. The next-to-leading 



order QCD corrections to the ttj background are known [28]. However, a flexible numerical program that 



allows the imposition of phase-space constraints has not yet been released. In lieu of a better solution, we 
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estimate the uncertainty of our background using the inclusive cross section. We first note using Ref. j28ll 
that varying the leading-order cross section about a fixed scale by a factor of 2 also gives an approximate un- 
certainty of ±40%, as does the variation about the running scale used in our analysis. We then estimate that 
the residual scale variation of the next-to-leading order result is ±10-20%. Other handles on the normal- 
ization of this background will be available at the LHC. Significant statistics outside the signal region exist 
to normalize simulation predictions and determine the partonic luminosities that contribute. We consider 
below Fb = 0.10 and Fb = 0.20 as estimates of the obtainable precision in the background prediction. We 
have not included experimental systematics in our error estimate; these will certainly affect our results. It 
should be possible to gain control over these effects using the significant statistics outside the signal region. 

We present in Table ITTT1 the expected accuracy in the measurement of the coupling for the various error 
estimates discussed above. The limiting statistical precision for rju = 1 assuming 3001b- 1 is ±14% if 
all top modes can be used, and ±28% if only the semi-leptonic decays are included. With the possibly 
optimistic assumption that the background can be controlled with ±10% precision via a combination of 
next-to-leading order QCD and extrapolation from signal-free phase space regions, a precision of 19% in the 
color-octet coupling is possible with 300 fb -1 and rju = 1 using all top modes. This becomes 8% if rju = 2. 
The more pessimistic assumption of ±20% precision leads to measurements of 31% precision for r\u = 1 
and 10% for rju = 2. We caution the reader again regarding several omissions in our limited analysis; 
inclusion of experimental systematics will degrade the obtainable precisions, while more thoroughly using 
the shape differences between signal and background will improve the results. 



Fs Fb 


^€ (semi) 




i ' ,'.> (semi) 

\vv\ 


Cd) 




Vu = 


1 


Vu = 


2 





0.28 


0.14 


0.10 


0.05 


0.05 0.10 


0.33 


0.19 


0.12 


0.08 


0.05 0.20 


0.43 


0.31 


0.14 


0.10 


0.10 0.10 


0.34 


0.21 


0.15 


0.12 


0.10 0.20 


0.44 


0.32 


0.16 


0.13 


0.15 0.40 


0.72 


0.58 


0.24 


0.21 



TABLE III: Accuracy of signal measurement for single charged scalar production assuming various estimates of the 
errors in signal and background predictions. We have included results assuming 300 fb -1 of integrated luminosity for 
the choices rju = 1,2. Included are results using only semi-leptonic events (labeled semi) and results obtained using 
all events (labeled all). 
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B. bb — > S^,/ — > resonant production 

Another possible mode providing insight to GUT physics is resonant production of the neutral scalars 
through the partonic process bb — > Srj — > ti. This process is proportional to the coupling combination 
7/66 = \f]uVD\ 2 /^s- This production mode is normally suppressed by mf/v 2 , but can become sizable with 
large couplings. We take r\r> = 40 and rju = 1 to simulate this possibility. We note that the widths depend 
differently on t}u,d than the cross section, so different choices that leave the product \r]ur]D \ unchanged will 
lead to different results. 

An important constraint on the parameter |r/[/7/£>| is the decay b — > S7. Within the framework of minimal 
flavor violation, the overall coupling for this decay goes like rjijr]£)Vtb, where Vtb denotes the (3, 3) element 
of the CKM matrix. For tjd = 40, the constraints arising from b — > S7 on mj within minimal flavor 
violation are several TeV yO, pushing this state beyond the reach of the LHC. If minimal flavor violation is 
not assumed, then Vtb is replaced with an element of an arbitrary mixing matrix, which can be much smaller 
than Vtb ~ L permitting ms = 1 TeV. We assume m$ = 1 TeV to allow for this possibility at the LHC. 

Our study of this channel proceeds similarly to the charged scalar analysis. The leading order cross 
section for this process is 290 fb, several times larger than the loop-induced process gg — > Srj — ► ti studied 

previous studies that have indicated that other 



in Ref. 1150. We focus on the tt background, again relying on 



2211 and with the previously discussed caveats 



backgrounds can be suppressed in the semi-leptonic mode 1181 . 
on the fully hadronic mode. The cross section at leading order for ti production at the LHC is 640 pb. We 
present results for both the semi-leptonic and fully hadronic channels, as previous studies have shown that 



2711 . We slightly tweak the parameters in our top 



suppression of QCD multi-jet backgrounds is possible (26J, 
reconstruction algorithm to account for the different kinematics of this process with respect to the single 
charged scalar production: we set Aii max = 0.8 in the semi-leptonic top reconstruction and Ai? max = 1.1 
in the hadronic top reconstruction. We study this mode using a sample of 10 4 signal events and 10 6 ti 
background events. 



Cut Si Sh Bi Bh 

H T > 900 GeV 1500 4930 6.22 x 10 4 1.96 x 10 5 

3t u t 2 494 2840 2.37 x 10 4 1.20 x 10 5 

950 GeV < M tl , t2 < 1050 GcV 374 2380 5050 2.81 x 10 4 

TABLE IV: Effects of kinematic cuts on cross sections. Si and Sh denote the expected numbers of semi-leptonic and 
hadronic signal events after cuts for 100 fb" 1 of integrated luminosity; Bi and Bh denote the same for the background. 
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We present in TablellVlthe effect of the reconstruction on the signal and background processes assuming 
100 fb _1 . We demand H T > 900 GeV and require that our top reconstruction algorithm find two top 
candidates. We also demand that the invariant mass reconstruct to the scalar mass. The results for S/ \/~B 
and S/B obtained after each successive cut are shown in Table IVl we again note that the large significances 
obtained with only the H~t cut are illusory, as we believe the top tags are required to suppress other sources 
of background, at least in the fully hadronic mode. Our results indicate that observation is possible using 
only the semi-leptonic mode with 100 fb , the significance is over 10 if the hadronic events can be used. 
We present in Table IVTl a study of the estimated accuracy of the signal measurement using Eq. (IV. 15b . It is 
clear from the S/B values appearing in Table IVl that control of the systematic error in the background will 
be crucial. As it should be possible to extrapolate the background from the M tlt t 2 sidebands, we study the 
expected accuracy assuming small normalization uncertainties in the background ranging from 1-5%. If 
a 1 % normalization uncertainty of the background is possible, then a measurement of the r/bb coupling of 
24% accuracy using only semi-leptonic modes is possible. This improves to 14% if all top modes can be 
utilized. The estimated precision worsens dramatically as the uncertainty Fb is increased. 



Cut 


Si/VBl 


Si/Bi Si+h/y/Bi+h 


Si+h/Bi+h 


H T > 900 GcV 


6.0 


0.024 12.7 


0.025 




3.2 


0.021 8.8 


0.023 


950 GcV < M tlM < 1050 GcV 


5.3 


0.074 15.1 


0.083 



TABLE V: Obtained significances for the neutral scalar resonance production after successively imposing the cuts in 
TablellVI S;+h denotes the combination of semi-leptonic and fully hadronic signal events, and Bi+h denotes the same 
for the background. We assume 100 fb -1 of integrated luminosity. 



F B 


^ (semi) 

Vbb v ' 


§2M. ( a ii) 

Vbb v ' 





0.20 


0.07 


0.01 


0.24 


0.14 


0.025 


0.39 


0.31 


0.05 


0.70 


0.60 



TABLE VI: Accuracy of signal measurement for the neutral scalar resonances assuming various estimates of the 
theoretical errors in signal and background predictions. We have included results assuming 100 fb -1 of integrated 
luminosity for the choice rjjj = 1, r)o = 40. Included are results using only semi-leptonic events (labeled semi) and 
results obtained using all events (labeled all). 
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VI. CONCLUSIONS 

In this paper we have studied the production and detection of color-octet scalars at the LHC. We have 
focused on single production of both charged and neutral members of an (8, 2) ly / 2 doublet through bottom- 
quark initiated partonic processes. Such production modes probe the Yukawa structure of the scalar sector. 
These scalars appear in realistic SU(5) grand unified theories where the 45// representation is used to 
generate fermion masses for the charged fermions. One of their couplings is defined by the difference 
between the down quark and charged lepton Yukawa couplings. In the case of adjoint SU(5) these 
fields are expected to be light to satisfy constraints coming from unification and proton decay, and may 
have TeV-scale masses. Measuring the mass of the color-octet scalar gives an upper bound on the proton 
lifetime. Thus, the LHC may potentially be able to probe GUT-scale physics if these states are observed. 

We study observation of TeV-mass scalars using the two partonic modes bg — ► St — > tib and 
bb — > Srj — ► ti. We carefully analyze whether the signal can be seen over Standard Model top-quark 
backgrounds at the LHC, and devise a set of cuts based on the kinematics of the signal that facilitate obser- 
vation. As the final state contains boosted top quarks, we discuss their reconstruction in both semi-leptonic 
and hadronic decay modes. We find that observation of the charged scalar with S/ \f~B > 5 is possible 
with slightly more than 300 fb" 1 if only semi-leptonic tt modes are used, while less than 100 fb is 
needed if fully hadronic top decays can be utilized. Observation of the bb — > Srj — > ti channel requires 
less than 100 fb" 1 in both cases. We roughly estimate the effect of systematic errors on the measurement 
of the Yukawa parameters that lead to the signal cross section, and find obtainable precisions of 10-30%, 
depending on the size of the couplings. 

In summary, probing the color-octet scalar Yukawa structure through bottom-quark initiated processes at 
the LHC may potentially provide a window to GUT-scale physics, and create an interesting synergy between 
collider measurements and future proton-decay searches. 
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APPENDIX A: DECAY AND PRODUCTION FORMULAE 

We present in this Appendix the production and decay formulae for the charged and neutral scalars used 
in our analysis. We begin with the decay modes. The decay rates for Sr are 



T(S R 66) 



m s \yp\ 2 ml 

167T V 2 

3/2 



rvc +7\ ms M 2 ^! j -i 4 ™ 2 
167T 



™s j I^dI 2 ^ , |r?f7 | 2 rri,f f 1 4m| 

4 



3/2 



r ^ = Tfi l 2 + 2 T >■ (A-l) 
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The decay rates for 5/ are 
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The decay rate for the charged scalar is 



T{S + - tb) = 3 " UI 2 H mi - m )'. (A3) 



|2_2 

|W 

"S ' 

As we only study the charged scalar in the limit rj£> = 1, we neglect contributions from this coupling. 

We now present the partonic cross sections for the relevant processes. The differential partonic cross 
section for charged-scalar pair production via gluon fusion studied in Fig.|2]is 

dcr{gg -> S + S~) _ 9ira 2 ( s 2 - m 2 s s + m| + st - 2m%t + t 2 \ 

It "T^V (s + t-m 2 ) 2 (t-m 2 ) 2 J ( 

x ( s 2 t 2 + m 4 s s 2 + 2m%st + 2st 3 - 4m 2 s st 2 + t 4 
+ 6m|t 2 - Am%t + m%- 4m|i 3 ) , 

where the Mandelstam variable t = (p gi — Ps+) 2 = {Pg 2 ~ Ps-) 2 > an d s is the partonic center of mass 
energy. The differential partonic cross section of S + i production from gb scattering is 

da(gb — > S + t) _ a s \r]u\ 2 m 2 ( At 2 - st + As 2 - 8tm 2 + sm 2 + Ara\\ 

(A. 5) 



dt ' 72s 3 v 2 \ (t-m 2 ) 2 (s + t-m 2 ) 2 

st 2 - 2mfs + 
+ 2m 2 s t 2 + 2m A s m 2 t - 2m 2 s mf + m\t 2 - mft) , 



x ( m 2 t s 2 - s 2 t + 2m\ st - 2st 2 - 2m\s + 2m\st - 2m%t + ra\ - t 3 
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where now the Mandelstam variable t = (p g — pi) 2 = (pg — Ps+) 2 - The differential partonic cross section 
for resonant top-quark pair production mediated by the neutral octet scalar, bb — > Srj — ► tt, is as follows. 
The interference between the Sr and Sj exchanges vanishes, and the cross section conveniently separates 
into two simple expressions: 

da(bb ^ S R ^tt) 1 i2 m t m l 1 ( i 4m ? 
t. = =^\nnnn\ ; — tz. ttt^ ftt-tt 1 



dt ■ TJhr'^ 1 ^ (g-m|)2 + m|I* V 1 " ~ J ' (A " 6) 

where T # and T/ have been defined above. 
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